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This paper aims to generalize and unify classical criteria for com- 
! parisons of balanced lattice designs, including fractional factorial de- 

XJ} • signs, supersaturated designs and uniform designs. We present a gen- 

eral majorization framework for assessing designs, which includes a 
stringent criterion of majorization via pairwise coincidences and flex- 
ed 1 ible surrogates via convex functions. Classical orthogonality, aber- 
ration and uniformity criteria are unified by choosing combinatorial 
and exponential kernels. A construction method is also sketched out. 



\ 1. Introduction. We consider three types of balanced lattice designs in- 

cluding the fractional factorial design (FFD) [5], the supersaturated design 
. (SSD) [1, 16] and the uniform design (UD) [10]. These have been widely 

CO ' used in agriculture, industry, scientific investigations and computer exper- 

■ iments, since a good design cannot only reduce experimental cost but also 
provide more efficient parameter estimation. Among many criteria for opti- 

"p^ ' mum factor assignment, minimum aberration [12, 17, 24, 26] considers the 

^— > ■ confounding situation between treatment effects under ANOVA decompo- 

sition; E(s 2 ) [1] and Ave(x 2 ) [27] measure two-factor orthogonality combi- 
natorially; the discrepancy [14] considers the estimation of the overall mean 
from a multivariate quadrature perspective. The criteria are derived from 
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different principles, which may confuse users seeking to choose a suitable 
criterion for a specific experiment. Two natural questions are whether these 
individual criteria are connected and further, whether they can be unified 
into a single framework. 

This paper aims to establish such a framework using majorization tech- 
niques. Majorization theory is appealing not only for its simplicity in con- 
cept, but its usefulness in many diverse fields; see [19] for a complete account. 
It was used as a tool in the early study of Kiefer's optimality criteria for opti- 
mal designs by considering eigenvalues of the Fisher information matrix; see 
[2, 22] and references therein. However, there has been little application to 
lattice designs, until the recent work of Cheng and Mukerjee [3] and Cheng, 
Steinberg and Sun [4] on estimation capacity, as well as that of Fang and 
Zhang [11] on projection aberration. In this paper we apply majorization 
theory to pairwise coincidences of experimental runs in order to study min- 
imum aberration, discrepancy and some supersaturated design criteria. The 
lower bounds of these criteria will be provided in a unified way. 

The paper is organized as follows. In Section 2 the majorization framework 
is proposed for balanced lattice designs, together with a two-stage investiga- 
tion scheme through an illustrative example. Section 3 is devoted to unifying 
classical criteria surveyed above. In Section 4 an algorithm will be sketched 
out for constructing designs under the new framework. Technical proofs are 
given in the Appendix. Throughout this paper we use |u| to denote the car- 
dinality of a set u. The function (p = if x < j and — x(x — 1) • • • {x — j + 1) 
otherwise. The Kronecker delta 5(x,y) = 1 if x = y and is otherwise. 

2. Majorization framework. Consider experiments of s factors each hav- 
ing q levels. A lattice design with n runs is a set of n points chosen from the 
lattice space C(q s ), the s-fold tensor product of the integer set {0, 1, . . . , q — 
1}. Each coordinate of C(q s ) corresponds to a factor. It is balanced (or 
U -type) when the q levels appear equally often for each factor. The set 
of balanced designs is written as ^(n,q s ). Either the fractional factorial 
design with resolution- (t + 1) or the orthogonal array OA(n,s,q,t) lies in 
^(n,q s ), provided that, for any t columns, all the possible level combi- 
nations appear equally often. The uniform designs are constructed from 
^(n,q s ). The orthogonal designs have strength t > 2 and they are saturated 
if n = 1 + s(q — 1); otherwise, the orthogonality is not attainable, as in su- 
persaturated designs. For design selection, let 3>(n,q s ) denote the space of 
competing designs, which is restricted in this paper to be either <ft (n, q s ) or 
its subset. 

Of our primary interest is the coincidence measurement between lat- 
tice points, which, together with its counterpart Hamming distance, plays 
an important role in the studies of designs and codes. For any x,w € 
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C(q s ), the coincidence /3(x, w) '=Y^ S j=i&xj,w m terms of the Kronecker 
delta. It follows that /3(x,x) = s and /3(x, w) = /3(w,x). For a lattice de- 
sign X(n, written as an n x s matrix with entries Xy from {0, 1, . . . ,q — 
1}, define its pairwise coincidence (PC) vector /3(X) := (fix, /?2, • • • ,(3m)' 
by collecting /^(x^Xfc) for 1 < i < k < n consecutively, where m = n{n — 
l)/2 and /3(xj,Xfc) = {3 n (i-x)+k-i(i+i)/2- We call two lattice designs PC- 
different if their PC-vectors cannot be exchanged by permutation. For iso- 
morphic designs that are equivalent after reordering runs, permuting co- 
ordinates or switching levels, they hold the same increasing order statistic 
of PC-vector. The PC-different designs are nonisomorphic. The PC-sum 
J2T=i Pr '■= J2i<k Sj=i $xij,x k j remains invariant in both isomorphic and non- 
isomorphic balanced designs, by observing that 1 + J2k^i^x i:j ,x kj = n/q for 
any i,j. 

Lemma 1. For any X £ <%f(n,q s ), its PC -sum is — 1). 

Let us now briefly review the majorization theory of Marshall and Olkin 
[19]. For a nonnegative vector x 6 M.™, denote its increasing order statistic 
by sji] — x [2] ^ ' " ' ^ x [m] ■ We say x is majorized by y and write x ^ y if 

k k mm 

(2.1) X^M-X^M' fc = l,2,...,m-l and ^]x r = ^y r . 

r=l r=l r=l r=l 

If there exists at least one strict inequality J2r=i x [r] > J2r=i VM f° r some k, 
we write x -< y strictly. A real- valued function : — > K is called Schur- 
convex if ^(x) < ^(y) for every pair x, y € with x ^ y. Necessarily, ^(x) 
is symmetric in its arguments, that is, invariant under permuting Xx, . . . , x m . 
We are mainly interested in the following separable convex class of Schur- 
convex functions: 

m 

^(x) = ^^(x r ), ip is convex on R + , 

r=l 

as well as their monotonic mapping g(^(x)) for some g. Hardy, Littlewood 
and Polya (HLP) [13] derived the following equivalent condition; or see 
page 108 of [19]. 

Lemma 2 (HLP). The inequality ^(x) < <I>(y) holds for all separable 
convex functions ^ : W 71 — > R if and only if x ^ y . 

Consider the PC-mean of any balanced design X(re,g s ), which is a con- 
stant (3 = g^-ij by Lemma 1. For integer- valued /3(X) with length m, let 

P = {P,...JY and P = (9,.. .,0, 9 + 1,. ..,9 + 1) ', 

m m(l-f) mf 
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where 6 and / are the integral part and fractional part of j3, respectively. It 
is clear that (3 < (3 ^ /3(X), where (3 reduces to /3 when / = 0. By Lemma 2 
we have a generalized version of Lemma 5.2.1 of Dey and Mukerjee [5]. 

Lemma 3. For integers Pi,...,(3 m with mean (3, any separable convex 
function J2™=i V'(A-) has a tight lower bound m(l — f)^{9) + mfil)(6 + 1), 
where 6 and f are the integral part and fractional part of (3, respectively. 

Based on decision theory and majorization theory, we propose a general 
framework for balanced designs via their PC measurements. 

Defintion 1 (Majorization framework). In the space f^C %(n,q s ) of 
competing designs, we define the following: 

1. X is inadmissible if there exists X' s.t. their PC-vectors satisfy /3(X') -< 
0(X); 

2. X is majorant if /3(X) < /3(X) for all X' G Q\ 

3. X is Schur-V> optimal if *(X;V>) < V(X';ip) for all X' 6 9, where 

m 

Schur-V' criterion *(X; ip) := ^ ip(f3 r (X)) 

r=l 

is determined by a specified convex kernel function tj) : R + — > R. 

The three parts in Definition 1 can be divided hierarchically into two 
stages of investigation, namely, stringent majorization check and flexible 
Schur-convex comparison. At the first stage, for competing designs in @(n, q s ), 
compute their PC-vectors with elements sorted in increasing order. Compare 
the cumulative summations in the sense of the majorization ordering (2.1). 
By Definition 1.1, any inadmissible design should be prohibited for experi- 
mentation; by Definition 1.2, the majorant design(s) if it exists is the winner 
and absolutely recommended; otherwise, we need Definition 1.3 and go to 
the second stage for comparing admissible designs. The first stage is strin- 
gent since majorization requires strong conditions between PC-vectors. At 
the second stage, specify a convex kernel and compute the Schur--0 value for 
each admissible design. Since the above Schur-ip criterion is single-valued, 
all the designs are pairwise comparable and able to be rank-ordered. For 
different specific purposes, it is very flexible to predefine kernels, as long as 
they are convex functions. In the next section we shall discuss how to choose 
suitable kernels for investigating the orthogonality, aberration and unifor- 
mity properties of designs. Now let us illustrate the ideas with an example 
and some toy convex kernels. 
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Table 1 

Example of 27 -run three-level design, transposed X(27, 3 s ). Each row represents a factor, 
for which each level appears nine times. The 70 ^-factor sub-designs are of interest 

[7(27, 3 8 ) 1 2 3 4 5 6 7 8 9 10 11 12 13 14 15 16 17 18 19 20 21 22 23 24 25 26 27 

A 120200012102210202011121102 

B 120002221021201011112221000 

C 221111010021000221012102022 

D 112012102102220101102102020 

E 121122002020122200121110001 

F 000102101211010222222120011 

G 122201200000121121020122110 

H 202011201222200210020111101 



Example 1. For the purpose of illustration, consider the following sce- 
nario: an experiment of 27 runs with 8 factors each having 3 levels. The 
experimenter chooses a uniform design tabulated in Table 1, which was 
obtained by Fang, Ma and Winker [8]. The experimenter has some prior 
knowledge: among the 8 factors, 4 factors may have potential impact on 
the output, while the other 4 have little impact; he is interested in includ- 
ing all of them in the study. To incorporate such prior information into the 
design of the experiment, he wants to choose a sub-design consisting of 27 
runs with 4 factors for the 4 potential factors. This leads us to study the 
following problem: how to choose the 4-factor sub-design from Table 1? 

There are in total ( 4 ) = 70 choices of sub-designs from this table, which are 
all balanced and form the design space 0(27, 3 4 ) of this study. For demon- 
stration, 4 sub-designs from the design space @ are chosen through Xi = 
{A, C, G, H}, X 2 = {B, C, G, X 3 = {A, B, D, F} and X 4 = {A, D, E, F} 
labels of factors. Let us make a two-stage investigation of & under the ma- 
jorization framework: 

• Stage 1: stringent majorization check. 

For each sub-design, its PC-vector has length 351 and sum 972. By ma- 
jorization ordering, not all 70 sub-designs can be compared. There exists 



Table 2 

Numerical results of Schur-convex comparison for Xi to X4 from S>(27, 3 4 ). The lower 
bounds in the last column are derived from Theorem 1 (where denotes ) 



Convex kernel 


*(X;V) 


Xi 


x 2 


x 3 


x 4 


Lower bound 


Variance 




0.6391 


0.6391 


0.6732 


0.6789 


0.1775 


Power 




1658.7 


1724.5 


1765.5 


1790.4 


984.8 


Exponential 




683.4 


685.6 


687.9 


688.5 


648.9 



G 
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where both X3 and X4 are inadmissible (even though X3 overwhelms 
X4), but the admissible Xi and X2 are not distinguishable at this stage. 
• Stage 2: Schur-convex comparison. 

Let us choose three toy kernels for comparisons, namely, a variance kernel 
ipi(x) = m~ 1 (x — x) 2 , a 7rth-power kernel ip2( x ) = x n and an exponential 
kernel based on the golden ratio tp3 (cc) = ( l+ ^ ) x . Numerical results of 4 
sub-designs are shown in Table 2. Their Schur-?/; values are rank-ordered 
as 



where the equality holds only for ip\ . It is shown that inadmissible X3 , X4 
always have large Schur-?/> values no matter what convex kernel is used. 

Under the classical criteria, both Xi and X2 are orthogonal designs of 
resolution 3; their (generalized) word-length patterns are given by 



and the wrap-around L2-discrepancy values are given by WL/2(Xi) = 0.4242 
and WL 2 (X 2 ) = 0.4245. We find that in the complete pool 0(27, 3 4 ) of 70 
competing designs, Xi is not only an FFD with minimum aberration but 
also a UD with minimum WL2-discrepancy. 

The above example demonstrates both stringency and flexibility of the 
majorization framework for assessing designs. The kernel selection problem 
at stage 2 is discussed in some detail by Zhang [28], who also explains why 
Xi and X2 are not distinguishable under the variance kernel. Formally, we 
have the following main theorem to characterize the necessary and sufficient 
conditions between majorant designs and Schur-^ optimum designs, accord- 
ing to Lemma 2. We also employ Lemma 3 to derive the lower bounds for 
specific Schur-'0 criteria. 

Theorem 1. A balanced lattice design is majorant if and only if it is 
Schur-ip optimum w.r.t. every convex kernel. For any well- defined Schur-ip 
criterion, it has a lower bound m{\ — f)ip{9) + mfip(6 + 1). 

The lower bound is presented for general PC-mean /?, either integer- 
valued or not. Obviously, if ^Efj ^ s a positive integer, / = and the lower 

bound reduces to mip{(3). This bound is attainable if there exists an equidis- 
tant design X in (n, q s ) such that all the Hamming distances between dis- 
tinct runs are identical, that is, PC(X) = (3. Equidistant designs are a typical 



*(Xi;^-) < *(X 2 ;Vi) < *(X 3 ; V;) < *(X 4 ;^) 



forj = 1,2, 3 



GWP(Xi) = (0,0,10/9,8/9) 



GWP(X 2 ) = (0,0,46/27,20/27) 
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type of majorant design, and examples are two-level SSDs constructed by 
the half-fraction Hadamard method [16], multi-level SSDs constructed from 
resolvable balanced incomplete block designs [6] and saturated OA(n, s,q,2) 
designs whose /3(xj,Xfc) = s — n/q for any i^k [20]. If ^fej~n returns a non- 
integer, the lower bound by Theorem 1 can be achieved by weak equidistant 
designs with j3 whose elements differ at most by 1 . Examples can be obtained 
by either adding a balanced factor to or removing a factor from saturated 
designs. Note that the bound is tight in some cases but not generally tight 
under all parameter (n,s,q) settings. 

3. Unification of classical criteria. The design criteria for FFD, SSD and 
UD are discussed in this section. The majorization framework and, in par- 
ticular, the flexible Schur-^ criteria based on combinatorial and exponential 
kernels are used to unify the criteria of minimum aberration and discrepancy. 
By Theorem 1, their lower bounds are generated automatically. Throughout 
this section 9 is the integral part of (3 [in particular, (3 = for X(n,g s )] 

and f = (3-6. 

3.1. Fractional factorial designs. FFD is an important experimental strat- 
egy and usually measured by the minimum aberration criterion originally 
proposed by Fries and Hunter [12] for regular designs. We rely on its gener- 
alization by Xu and Wu [26] for both two- and multi-level, both regular and 
nonregular designs. Based on the ANOVA decomposition model, define for 
X(n,q s ) 

(3.1) ^■(X):=-^||G J |^EE-^trace(GfG J ), j = l,...,s, 

where Gj is the matrix consisting of all j-factor contrast coefficients (|| • 
Frobenius norm; H : conjugate transpose). The (generalized) word-length 
pattern (GWP) is defined by (A±, . . . ,A S ), in which A± = for balanced 
designs. For two such patterns x,y E define a partial ordering |= as 
follows. We write x h y if the first nonzero element of x — y is negative, 
and write x |= y if x h y or x = y. An FFD has minimum aberration if its 
GWP achieves the minimum under \=. Ma and Fang [17] and Xu and Wu 
[26] connected word-length pattern with Mac Williams' transform of distance 
distribution in coding theory, 

(3.2) A j (X) = -J2E l (K)P j (l;s,q), j = l,...,a, 

n 1=0 

where Ei(X) = n _1 |{(xj,x fc ) : /3(xj,x fc ) = s-l, i, k = 1, . . . , n}\ for I = 0, . . . , s 
and 

P^s,q)=j2(-ir(q-iy^( X )( S - X w ) 
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are Krawtchouk polynomials ([18], Section 5.7). Clearly, Aj(X) can be ex- 
pressed as ^Y%LiPj(, s ~ Pr',s,q) + (j) ■ To unify the minimum aber- 
ration through Schur-^ criterion, a direct idea is to use Krawtchouk polyno- 
mials, P2(s — x;s,q) to P s (s — x;s,q). However, the function Pj(s — x;s,q) 
is not generally convex except for j = 2, which implies that it is trivial to 
unify A2 and find its lower bound, but nontrivial for higher-order ASs. 

Let us make an indirect approach by a series of combinatorial functions. 
For X(n,g s ), define the Schur- combinatorial criterion of affine form, 

(3.3) * C (X; j) := 2 £ ) - ( S . ) - n) , j = l,...,s, 

which are all separable convex on M.™. The criterion can be interpreted sta- 
tistically as follows. Consider ^c(X;s) first. Randomly on C{q s ), each x 
has uniform probability n/q s of entering the n-point design. Let iV x be its 
true occurrences in X(n,g s ). The value (iV x — n/q s ) 2 measures the varia- 
tion of the design centering x. For q s different points, the total variation 
Sxg£(<? s )(-^x — n /Q s ) 2 is therefore a measure of uniform covering, which 
equals \&c(X;s). Formally, we have: 

Theorem 2. For X(n,g s ) and S = {1, . . . ,s}, the Schur- combinatorial 
criterion 

(3.4) * c (X;j)= Yl E ( N i u) -^) 2 forj = l,...,s, 



uCS ,\u\=j x€C(qi ) 



(u) 

where iV x counts the runs whose u- coordinates take level-combination x. 
Further, the design X has orthogonal strength t if and only if ^fc(X.;j) =0 
for 3 = l,...,t. 

Projection properties are taken into account in Theorem 2. Let X u de- 
note the u-coordinate sub-design. Thus, ^c(X;j) = J2\ u \=j ^c(X u ; j) sums 
up the measurements at all j-dimensional sub-spaces. Besides the geometri- 
cal meaning, ^c(X;j) measures the variation from the j-factor orthogonal 

strength. From (3.4), ^c(X; j) > 0, where equality holds if = n/qi , 
which occurs if and only if X is an orthogonal array of strength j. 

To use a relatively simple notation, define for X(n,<? s ) the root-mean- 
squared deviation criterion, 



\ £ (jV x -n/^ = J1* C (X; S ), 

\ q V I" 



(3.5) B S (X) 



as well as Bj(X.) := for j < s. Let us call (JBi, . . . , B s ) a de- 

viation pattern, which reduces to the projection ^-criterion for two- level 
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designs [23]. Note that Bt = implies that Bj = for j < t. Analogous 
to the word-length pattern, A\ = B\ = for balanced lattice designs, and 
A t = D t = 0, At+i > 0, Bt+i > for resolution-(i + 1) orthogonal designs. 

Theorem 3. For design X £ &(n,q s ), the deviation pattern and word- 
length pattern are linearly related by 

(3.6a) £f(X) = $t( S j Zi) M*) for J = I,..., s. 
Their benchmarks are given by (0, A\ , . . . , A*) and (0, , . . . , B*) , in which 
A* = (l - 1) ((1 - f)Pj(s - 9; s, q) + fPj(s -0-1; 8, q)) 

(3.6b) 

for j = 2,...,s, in the sense that (0,A%, . . . , A* s ) \= (0, A 2 (X), . . . , A S (X)) 
while Bj < Bj(X) for all j simultaneously. 

Checking the simplest case for balanced designs with integer-valued 
PC-mean /?, we get A2(X.(n, q s )) > s( - l? " 1 ^~^~ n+1 ^ , which is consistent with 

Fang, Ge, Liu and Qin [6] and Xu [25] for investigating supersaturated de- 
signs. 

3.2. Supersaturated designs. In the recent decade, SSDs, in most cases 
2-level factorials, have drawn much attention in screening experimentation 
due to their economic run size. Nonorthogonality criteria like E(s 2 ) and 
Ave(x 2 ) are used to evaluate/construct SSDs, as their orthogonal prop- 
erty is violated. For X(n,2 s ), let with (—1,1) entries represent the 
jth factor. Booth and Cox [1] originally defined E(s 2 ) by the mean inner- 
product s ( s 2 _ 1 ) J2i<j<i<s x Jj) x (i)- Let Nti,t2 be the number of runs whose 
(j, Z) factors take level-combination (t\,T2). Then we observe that XjT.xX^) = 
4En,7a=l( JV nS -n/4) 2 . For multi-level X(n,q s ), define 

Ave(x 2 ):= T ^ £ £ - n/, 2 ) 2 , 

S{S ' l<j<l<ST U T 2 = l 

which reduces to E(s 2 ) when q = 2 (after being multiplied by 4) and reduces 
to Yamada and Lin's [27] Ave(x 2 ) when q = 3 (after being multiplied by 
9/n). 
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By Theorem 2, we find that there is a natural link between Ave(x 2 ) 
and \I'c(X;2) based on the combinatorial kernel, as well as the deviation 
measure B2 on the 2D sub-space. For simplicity, we give a unification scheme 
through the quadratic kernel ip((3) = j3 2 and the associated Schur-^ criterion 
^(X;f3 2 ) :=J2T=iPri which is equivalent to Vl/c(X;2) for balanced designs. 

Theorem 4. The nonorthogonality criterion Ave(x 2 ) for SSD X € 
ty/ (n, q s ) satisfies 

Ave (* 2 ) = 7(7ri) * (X; 02) + a - W^Y) {62 + 26 f + f) + a > 

where the constant a = - ns+ Z ^ 1 ~ s ~ q ' 1 . 

The lower bound follows directly Theorem 1. When the PC-mean (3 is an 
integer, 

2 n 2 (s-n+l) 2 n 2 (g-l)((g-l) S -n-l) 

^ s >^ ( s _!)(„_!)' Ave UJ^ q 2( s -l)(n-l) 

where the lower bounds can be attained by optimum SSDs constructed from 
partial saturated designs, resolvable BIBDs or an algorithmic approach [6, 
16, 21, 27]. 

3.3. Uniform designs. UD is of space filling type and becomes more 
and more important for computer experiments. For n design points scat- 
tered into the lattice space, Fang and Wang [10] suggested using the star 
discrepancy as the uniformity measure, which corresponds to the famous 
Kolmogorov-Smirnov statistic for goodness-of-fit testing between .F n (x), the 
empirical distribution of the design, and -F*(x), the uniform distribution. A 
discrepancy defined in quasi-Monte Carlo methods can be viewed as a norm 
||.F n (x) — F„,(x)|| of some reproducing kernel Hilbert space [14]. For assess- 
ing qualitative factorial assignment, Hickernell and Liu [15] proposed the 
discrete discrepancy, which is a special case (when /j, = 0) of the categorical 
discrepancy below. 

Defintion 2 (Categorical discrepancy). On the lattice space C(q s ) and 
the set S = {1, . . . , s}, for each nonempty u QS, any x, w S C(q s ) and any 
design X with points xi, . . . ,x n € C{q s ), define the categorical type of hat 
reproducing kernel function and hat discrepancy 

K u (x, w) = Y[(b+(a- b)S Xj!Wj ), 

( 1 n \ 1/2 

\ i,k=l I 
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where a is a given positive constant (a < q — 1), 6 is chosen from [— -^j-, a) and 
/j, = Ma + (g — 1)6). Define the categorical discrepancy pattern (Z?i, . . . , D s ) 
and the categorical discrepancy -D(X;aV6), respectively, by 



Z^(X;aV&) = / £ Dg(X;l 
V l«H 

and 



D(X;aVl)) 



\ 



££>?(X;aV&), 

3=1 



where a V 6 denotes categorical assignments to the hat kernel 

The parameter constraints a > and — <b < a are set to ensure that 

the bivariate K u is nonnegative definite. For j = 1, . . . , s, -Dj(X; a V 6) sums 
up the hat discrepancies of all possible j-factor projection designs. Hickernell 
and Liu [15] showed that when the parameters satisfy a + (q — 1)6 = 0, 
the categorical discrepancy pattern under partial ordering |= is equivalent 
to the minimum aberration criterion. For X(n,q s ) under our majorization 
framework, by using the kernel ip(/3) = pr with base p > 1, we can define the 
Schur- exponential criterion ^e(X;p) := J2r=i P^ r ■ 

Theorem 5. For any lattice design X 6 ^(n,g s ), the squared categori- 
cal discrepancy is equivalent to the Schur- exponential criterion, 

(3.6g) ^(X;. v 6) -»!Si£i + _(! + „,., 

n z n 

where the exponential base p = (1 + a) /{I + b). It has lower bound 

D 2 (X; a V 6) > -((n - 1)(1 - / + pf)p d + (1 + a) s ) - (1 + ^) s . 
n 

The centered L2- discrepancy (CL2) and wrap-around L2- discrepancy (WL2) 
are popular uniformity measures for quantitative experiments; see [14] and 
[8] for the details. For modest-level designs, CL2- and WL2-discrepancies 
have similar properties to categorical discrepancy, that is, the reproduc- 
ing kernel values between distinct runs are determined by coincidence mea- 
surement. They correspond to the Schur-exponential criteria under different 
bases. 

Corollary 1. For X 6 %(n,q s ), the Schur-exponential criterion can 
cover 
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g = 2:WL^(X)-a 2 = ^ -L (-) > 



2™ _ _ 2v& E (X; 1.2) f 5 y ^ (n - 1)(5 + /) /5V (6\ e 

5n \4J \5J ' 



s . S! wiicx)-«-»!^m(| 

(n- l)(23 + 4/) /23\ s /27 



~ 23n V23, 

^ere a x = ±(f ) s + (±§) s - 2(f ) s and o 2 = ±(§) s - (§)*. 

The lower bounds derived above for CL/2-discrepancy and WL<2-discrepancy 
are tighter than [9] and [6] . 



4. Algorithmic construction. In our framework, majorization on pair- 
wise coincidences is conceptually simple. From a geometric point of view, it 
enforces pairwise coincidences spread as equally as possible, which is uni- 
versally applicable to various criteria discussed above. By the majorization 
idea, an optimization approach with heuristic searches is in development. 
Although this paper addresses mainly theoretical aspects of design criteria, 
we briefly sketch out our algorithmic construction method, in particular, its 
basic operation of Robin Hood swap. The swapping algorithm aims to take 
one unit from the /3-large pair of coincident runs and give it to the /3-small 
pair, analogous to the legend of Robin Hood. 

Algorithm 1 (Robin Hood swap pseudo-code). Given a convex kernel 
function tjj : R + — > K and a balanced lattice design Xef (n,g s ): 

Step 1: Compute the coincidence matrix M and find its maximal entry (en- 
tries); for each such pairwise coincidence fin. of runs (xj,Xfc), do 
steps 2 and 3. 

Step 2: Find the run(s) that has minimal coincidence from x». For each such 
run Xf , find the coordinates such that x%j = x^j , while xtj ^ xij 
for j e c €. 

Step 3: For each coordinate j € , find 3%i,Mt such that x W j = Xij , Vw6 £%i 
and x w j = xtj,Vw € Mt, respectively (i ^ St^t ^ 3%t). Compute the 
delta, 



- E i&iM+MM)- 
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Find the local minimum A* for j E 'tf. Record A* } if 

Al M) < 0. 

Step 4: Find from the record the global minimum A* and output {i*, t*, j*}. 

The algorithm works on a specific kernel function. For example, consider 
the quadratic kernel ^(x) = x 2 and the randomly generated balanced design 
X E ^(8,2 6 ) shown in Table 3 (left). Indicated by its coincidence matrix 
shown in Table 3 (center), the Robin Hood algorithm finally decides to 
swap the levels in the 4th factor between the first and last runs, in order 
to equalize pairwise coincidences. By such a single swap operation, we find 
that the PC-vector of the swapped design is majorized by the original PC- 
vector; see Figure 1 for the cumulative plots of sorted PC-vectors in the 
sense of (2.1). 

In Figure 1 the benchmark by Theorem 1 is also plotted, which has two 
slopes (corresponding to 9 and 6 + 1) rather than the dashed straight line, 
since (3 = 2.5714 for ^(8,2 6 ). Note that the benchmark can be attained by 
any 6-factor sub-design of the 8-run Hadamard design. The Robin Hood 
swap algorithm makes the random design move toward the benchmark. It- 
erative swapping can make it move closer. However, the above deterministic 
procedure often gets into the local optimum. Advanced stochastic optimiza- 
tion methods are therefore called for. Based on a similar column-wise swap, 
Fang, Lu and Winker [7] used the threshold accepting heuristic for construct- 
ing uniform designs. Our group is currently developing a similar heuristic 
based on Robin Hood swaps, which is beyond the scope of this paper. 

APPENDIX 

Proof of Theorem 2. Let us write the sum notation J2 u cs,\u\=j> 
J2xec(qi) as an d J2 X m short, respectively. Let S^' = 1 if the u-coordinate 

Table 3 

Robin Hood swap on X(8,2 6 ); at j* = 4, swap the levels between rows i* = 1 and t» = 8; 
only 12 entries (boldfaced ) are updated in the coincidence matrix 
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sub-tuples of Xj,Xfc take the same level combination and otherwise. It can 
be verified that 

u \ J / 

n 

Ei n, =E(^) 2 . 

i,k=l x 

and the Schur-combinatorial criterion (3.3) can be expressed as 
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n 

= EE4 U) = EE(^) 2 - 

i,k=l « « x 

By the method of variance decomposition, the right-hand side of (3.4) is 
given by 

E E - 5) 2 = E E ) 2 - 5 1&) + n2 



EE(^ } ) 2 



n 2 [ s 



q 3 \3 

which equals the left-hand side ^c(X.;j). □ 

Proof of Theorem 3. For the j-factor sub-design X w (n, g- 7 ) with 
u-coordinate factors, the word-length pattern (3.1) for 1 < k < j can be 
written as 

A k (Tt u ) = \ Y, IXv(X n )| 2 , 

Tl — 

«rf(v)=fc 

where {Xvj v € /^(g- 7 )} are given orthonormal contrasts and wt(v) is the 
number of nonzero elements of v [26]. Consider the deviation (3.5) of sub- 

design X u , B 2 (X U ) = ^jJ2i,k=i ( f ) _ where /% ; measures the coinci- 



dence between u-coordinate sub-tuples of Xj , and it cannot exceed j . So 
B 2 (X. U ) = ^i?o(X M ) — pj, where i?o(X„) is defined in (3.2). Since it is true 
that 

(3.6a) £J (X U ) = ^1 + E^( X «)J 

(verified at the end), it follows that B 2 (X U ) = & Yi=\ ^fc(X u ). 



For X(n,g s ) itself, ^(X) = ^/E| u | =j ^(XJ. Via { Xv , v € £(^)}, 

^( x )4eEw4eE E Ixv(xji 2 . 

y |n|=jfc=l y |u|=jfc=lwt(v)=fc 

Note that each contrast Xv(X u ) of the sub-design X n is also a contrast 
Xw(X) of X(n, g s ), where w coincides with v at u-coordinates and has null 
elements elsewhere. Denote by £l u the set of such w's. Then we have 

^ 2 (x) = -JjEE E ixw(x)i 2 

q k=l\u\=jweU n 



1 



J 



y fc=i VJ 7 wt(Mv)=k 
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By writing 4j E™t(w)=fc IXw(X)| 2 back to A k (X), (3.6a) is proved. The 
benchmark of deviation pattern follows from Theorem 1 directly. Since the 
word-length patterns are linearly related through (3.6a) with positive piv- 
oting coefficients, both patterns are equivalent under |=. We can therefore 
use (3 that determines the benchmark of deviation pattern to derive the 
benchmark for the word-length pattern. 

Let us now verify (3.6a) for X(n,g s ) through Krawtchouk polynomials. 
For an integer I (0 < I < s) and any real number y, Krawtchouk polyno- 
mials Pj(l;s,q) have the following property: J2j=o -^'(^ s > Q)y J = [1 + (l ~ 
l)y] s - l (l-y) 1 ([18], Section 5.7). By setting y = 1, we have Yfj=oPj(S>) s,q) = 
q s and £* =0 P j (I; s, q) = 0, for I = 1, . . . , s. By P (l; s,q) = l and £?=o E (X) = 
n, 



n 



j=i / i=i/=o 



i=o \j=o / 



n 



+ q a E (X) - £ Ei(X) = q s E (X). 



1=0 □ 



Proof of Theorem 5. In Definition 2, there is no risk in letting K = 1 
and D (^;M. ) = 0. By the expansion of tensor products and coincidence 
measurements, 

D 2 (X;aV6) = - J2 M H + 4 E E II i b + ( a ~ WW 

0CuC5 0CuCSi,k=ljeu 
n s 



i,fc=lj=l 

1 n 

-(1 + fi) s + — E (1 + a)^**'**^! + b) s ~^ Xk) 



^f:(i±|) ;; + (Lt^_ (1+M) .. 

n z ^— i V 1 + o / n 

r=l 

By letting /? = (1 + a)/(l + 6) , the identity (3.6g) follows. Provided that 6 < a 
and a < g — 1, and the exponential base 1 < p < oo, we can use Theorem 1 
to get the lower bound for categorical discrepancy. □ 
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